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In 2004 the Dutch Department of Social Affairs conducted a sur- 
vey to assess the extent of noncompliance with social security reg- 
ulations. The survey was conducted among 870 recipients of social 
security benefits and included a series of sensitive questions about 
regulatory noncompliance. Due to the sensitive nature of the ques- 
tions the randomized response design was used. Although random- 
ized response protects the privacy of the respondent, it is unlikely 
that all respondents followed the design. In this paper we introduce a 
model that allows for respondents displaying self-protective response 
behavior by consistently giving the nonincriminating response, irre- 
spective of the outcome of the randomizing device. The dependent 
variable denoting the total number of incriminating responses is as- 
sumed to be generated by the application of randomized response to 
a latent Poisson variable denoting the true number of rule violations. 
Since self-protective responses result in an excess of observed zeros in 
relation to the Poisson randomized response distribution, these are 
modeled as observed zero-inflation. The model includes predictors of 
the Poisson parameters, as well as predictors of the probability of 
self-protective response behavior. 

1. Introduction. In 2004 the Dutch Department of Social Affairs con- 
ducted a nationwide survey to assess the level of compliance with the Un- 
employment Insurance Act. Under this act employees who have lost their 
income due to unemployment are entitled to financial benefits, provided 
that they comply with the rules and regulations stipulated in the act. The 
participants in the survey were asked if they had ever violated against the 
regulations in the year preceding the survey. Since the disclosure of a rule 
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violation may have serious financial consequences for the respondent, the 
randomized response design was used. 

The randomized response method was first introduced in 1965 by Warner 
as an interview technique that protects the respondents' privacy [Warner 
(1965)]. In Warner's design the respondent is presented with two comple- 
mentary statements, for example "I am a marihuana user" and "I am not a 
marihuana user." The respondent then operates a randomizing device, like 
a pair of dice or a deck of cards, and the outcome of this device determines 
which of the two statements the respondents has to answer. Since only the 
respondent knows the outcome of the randomizing device, confidentiality is 
guaranteed. 

A meta-analysis of randomized response studies shows that the random- 
ized response design generally yields higher and more valid prevalence esti- 
mates of the sensitive characteristic than direct-questioning designs 
[Lensvelt-Mulders et al. (2005)]. However, a number of studies suggest that 
respondents do not always follow the instructions of the randomized 
response design. In an experimental randomized response design 
[Edgell, Himmelfarb and Duncan (1982)] with the outcomes of the random- 
izing device fixed in advance, about 25% of the respondents answers no to 
a question about having had homosexual experiences, while according to 
the design these respondents should have answered yes. In another experi- 
mental study [van der Heijden et al. (2000)] all respondents were known to 
have offended against social security regulations. Although the randomized 
response condition yielded higher estimates than the direct question design, 
the prevalence estimate of offenders obtained with randomized response was 
only about 50%. Another study involved an interview of participants in a 
randomized response survey [Boeije and Lensvelt-Mulders (2002)]. Many of 
the participants indicated that they had found it difficult to falsely incrimi- 
nate themselves when they were forced to do so by the outcome of the dice. 
Some of them admitted that in this situation they had given the nonincrim- 
inating answer instead. 

A recent topic of investigation in the field of randomized response is the 
estimation of evasive response bias. Clark and Desharnais (1998) show that 
the presence of evasive responses can be detected in a randomized response 
design with two groups that each use a randomizing device with different 
outcome probabilities. Kim and Warde (2005) present a multinomial ran- 
domized response model taking evasive response bias into account in designs 
with a sensitive question with multiple response categories that increase in 
sensitivity. The term self-protection (SP) was introduced by Bockenholt and 
van der Heijden (2004, 2007) to describe the responses by respondents who 
consistently give the evasive answer, without taking the outcome of the ran- 
domizing device into account. According to this definition, the SP response 
profile consists of nonincriminating (i.e., no) responses only. The authors 
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use models from item response theory to obtain prevalence estimates of the 
sensitive characteristics corrected for SP. The SP assumption is also used 
in log-linear randomized response models that study the association pat- 
terns between the sensitive characteristics and obtain prevalence estimates 
corrected for SP [Cruyff, van den Hout, van der Heijden (2007)]. 

The definition of SP implies that the probability of an evasive response 
does not explicitly depend on the sensitivity of the question or on the 
true status of the respondent. Although it is possible to formulate more 
complex assumptions with respect to the generation of evasive response 
bias, SP seems to provide an adequate description of the process. A study 
[Bdckenholt, Barlas and van der Heijden (2008)] modeling evasive response 
behavior in randomized response as a function of both the sensitivity of the 
question and the true status of the respondent found no compelling evi- 
dence for the superiority of these models in relation to the corresponding 
SP models. 

In this paper we introduce a regression model that allows for SP in ran- 
domized response sum score data. The model assumes a Poisson distribution 
for the true sum score variable assessing the individual number of sensitive 
characteristics. The model further assumes that the observed sum score vari- 
able denoting the number of incriminating responses is partly generated by 
the randomized response design, and partly by SP. Since SP by definition 
results in an observed sum score of zero, the distribution of the observed 
sum score variable is zero-inflated with respect to the Poisson randomized 
response distribution of the true sum score variable. The model allows for 
predictors that explain individual differences in the Poisson parameters, as 
well as predictors that explain individual differences in the probability of 
SP. Since the distribution of the observed sum score variable is a mixture 
of a Poisson randomized response distribution and observed zero-inflation, 
the model is called the zero-inflated Poisson randomized response regression 
model. 

The model is applied to randomized response data from a social security 
survey conducted in the Netherlands in 2004. Section 2 describes the data. 
Section 3 derives the zero-inflated Poisson regression model based as an 
extension of existing randomized response models for multinomial and sum 
score data. The section also includes a description of a maximum likelihood 
(ML) estimation procedure and an evaluation of the validity of the Poisson 
assumption with respect to the true sum score variable. The results for the 
social security data [Cruyff, Bckenholt, van den Hout and van der Heijden 
(2008b)] are presented in Section 4. Section 5 discusses some assumptions 
and interpretations of the model. 

2. The data. In 2004 the Department of Social Affairs in the Nether- 
lands conducted a nationwide survey to assess the level of noncompliance 



4 



CRUYFF, BOCKENHOLT, VAN DEN HOUT AND VAN DER HEIJDEN 



with the Social Security Law [compare Lensvelt-Mulders et al. (2006)]. The 
survey includes 870 participants who receive financial benefits under the 
Unemployment Insurance Act (UIA). Persons who have become (partially) 
unemployed are eligible for benefits. A beneficiary receives about 70% of the 
last earned wages, and the duration of the benefits depend on the length 
of the persons' employment history. Beneficiaries are required to report all 
activities that generate income in addition to their benefits or that might 
conflict with reintegration into the labor market. The failure to report such 
an activity may be sanctioned. 

The social security survey includes the following five questions assessing 
noncompliance with UIA regulations: 

1. Have you in the past 12 months ever had a job or worked for an employ- 
ment agency in addition to your benefit without informing the Depart- 
ment of Social Services? 

2. Have you in the past 12 months ever refused to accept a suitable job, or 
have you ever deliberately made sure you were not hired even though you 
had a chance of getting the job? 

3. Have you in the past 12 months ever deliberately put in an insufficient 
number of job applications for a sustained period of time? 

4. Have you in the past 12 months attended any day courses without in- 
forming the Department of Social Services? 

5. Have you in the past 12 months had any income in addition to your ben- 
efit, for example, from alimony, a scholarship, subletting, other benefits, 
gifts, interest and so forth, without informing the Department of Social 
Services? 

Due to the sensitive nature of the questions, the randomized response method 
is used. The respondents answer the questions with the use of a computer 
according to the forced response design [Boruch (1971)]. Before answering 
the question the respondent throws two virtual dice, and is instructed to an- 
swer yes if the sum of the dice is 2, 3 or 4, and to answer no if the sum of the 
dice is 11 or 12. If the sum of the dice is 5, 6, 7, 8, 9 or 10, the respondent has 
to answer the question truthfully. The misclassification probabilities, that 
are conditional on the true status of the respondent, can be derived from the 
probability distribution of the sum of two dice. Given regulatory noncom- 
pliance, the probability of a yes response is 11/12 and that of no response 
1/12. Given regulatory compliance, the probability of a yes response is 1/6 
and the probability of a no response 5/6. In the actual social security sur- 
vey, however, the programmer inadvertently programmed the virtual dice 
so that the probability of a yes response given regulatory noncompliance 
was 0.9329, and that of a yes response given regulatory compliance 0.18678. 
The number of observed yes responses to the five questions are respectively 
122, 195, 168, 207 and 274. Counting the total number of yes responses 
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for each respondent on the five questions yields the frequencies uq = 288, 
n\ = 295, n2 = 207, ?i3 = 68, ?i4 = 7 and 77,5 = 5 (with the subscript denoting 
the number of observed yes responses). 

The social security survey includes two kinds of predictors we like to 
explore, one concerning demographic variables and the other concerning 
variables related to the forced response design. The demographic variables 
gender, age, year unemployment, education and knowledge rules are used as 
predictors of regulatory noncompliance. The variables gender and age are 
dummy-coded with "male" (n = 483) and "older than 26" (n = 832) as re- 
spective reference categories. The variable year unemployment is a dummy 
variable denoting the last year of being employed, with the year 2004 as 
reference category (n = 257). The variable education (mean = 2.25, sd = 
0.67) measures increasing levels of eduction. The variable knowledge rules 
(mean = 3.8, sd = 0.90) denotes on a 5-point scale of the respondents' gen- 
eral knowledge of the social security regulations. The two variables trust 
and understanding are related to the forced response design and are used as 
predictors of SP. The variable trust (mean = 3.5, sd = 0.92) is constructed as 
the average score on four 5-point scale variables (Cronbach's Alpha = 0.87) 
assessing different aspects of the respondents' beliefs in the confidentiality 
and privacy protection of the forced response design. A high score on this 
variable corresponds to a high degree of trust. The variable understanding 
(mean = 4.2, sd = 0.85) assesses on a 5-point scale to what extent the respon- 
dent feels that he understood when to answer yes and when to answer no to 
a forced response question. High scores correspond to a good understanding 
of the forced response design. 

Figure 1 depicts the associations between the observed sum scores and the 
predictors. At this point we would like to emphasize that the plots should 
not be interpreted as depicting associations between the predictors and the 
true sum scores (i.e., the number of rule violations), since the observed 
sum scores are not corrected for the misclassification due to randomized 
response, nor for SP. The plots at the top of the figure show the observed sum 
score proportion conditional on the categories within the dummy variables 
gender, age and year unemployment. The profiles of males and females look 
similar. The plot for age shows that the proportion of zeros for the younger 
respondents (about 15%) is about half that of the older respondents. The 
younger respondents also have a relatively high proportion of ones (about 
45%). The profiles within year unemployment are again relatively similar, 
although persons who became unemployed in 2004 have a higher percentage 
of zero response (about 40%) compared to the respondents who became 
unemployed before 2004 (30%). The four plots at the bottom show the mean 
predictor scores within the observed sum scores for the respective continuous 
variables education, knowledge rules, trust and understanding. The predictor 
means do not show any clear linear associations with the observed sum score, 
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Fig. 1. Observed sum score proportion given the categories within the dummy variables 
gender, age and year unemployment (upper plots), and mean predictor score within ob- 
served sum score for the continuous variables education, knowledge rules, trust and under- 
standing (lower plots) . 



although for the variable knowledge rules the means seem to slightly decrease 
with increasing sum scores. The effect of sum score is most pronounced on 
the means of the predictor trust, but the association pattern is erratic. 

3. The model. 

3.1. The multinomial randomized response model. Consider a random- 
ized response design with M sensitive questions, each assessing the presence 
or absence of a sensitive characteristic. Let the random variable denote 
the observed response to the mth question, with y* m G {0 = no, 1 = yes} and 
m £ {1, . . . ,M}. Similarly, let Y m denote the true status with respect to the 
sensitive characteristic, with y m € {0 = absent, 1 = present}. The binomial 
randomized response model for the binary variable is given by 

l 

(1) ny m = V* m )= E Py* m \v m *y m , 

J/m=0 

where 7r y = F(Y m = y m ) and Py* m \ Vm = P(Y£ = y* m \Y m = y m ) are the condi- 
tional misclassification probabilities that can be derived from the probability 
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distribution of the randomizing device. For a more detailed discussion of this 
model, we refer to Chaudhuri and Mukerjee (1988). 

Next consider the true sum score of the M sensitive characteristics, de- 
noted by the variable 



M 



(2) S=Y,Ym. 



It 1=1 

If S follows a multinomial distribution with parameters ttq, . . . , ttm > then the 
multinomial randomized response model 

M 

(3) F(S* = s*)=J2ls*\sKs, 

s=0 

applies, where S* denotes the number of observed yes responses on the M 
sensitive questions and q s *\ s =F(S* = s*\S = s), for s,s* G {0, . . . ,M}. 

The misclassification probabilities q s *\ s , that exist if and only if the p y * \ Vm 
are the same for all m, can be derived as the multinomial probabilities 



( 4 ) i s \t=E Q Q ( a +jl t )pl\M\i^o *<o" 



for s, t € {0, 1, . . . , M} and t < s + j < M [Cruyff, van den Hout, van der Heijden 
(2008b)]. 

As an illustration, consider the forced response design of the social security 
survey with two binary variables Y\ and Yi. Application of (4) for M = 2 
yields the misclassification probabilities q s *\ s : 

'Qo\o Qo\i Qo\2\ ( Po|o Po|oPo|i Po|i 

9i|o 9i|i <7i|2 = 2 Po|oPi|o Pi|oPo|i+Po|oPi|i 2 Po|iPi|i 

,?2|0 92|1 ?2|2/ V Pl|0 Pl|0Pl|l Pl\l 

3.2. The Poisson randomized response model. Assume that the true sum 
score S is generated by a Poisson process with parameter A. Since realiza- 
tions of S are limited to the maximum value of M, the Poisson distribution 
of S is truncated at the right [Cameron and Trivedi (1998)], so that 

(5) ¥(S = s\s<M) 



with 

exp(-A)A 5 

(6) TT S = 
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Substitution of the multinomial probabilities ir s in model (3) for the ex- 
pression at the right-hand side of (5), with tt s defined as in (6), yields the 
(right-truncated) Poisson randomized response model 

M 

(7) HS* = s*\s*,s<M) = J2 Vs*\s^m^- 

3.3. The zero-inflated randomized response regression model. Count data 
are often characterized by an excess of zeros relative to a Poisson distribu- 
tion. To account for the excess of zeros, Lambert (1992) introduced the 
zero-inflated Poisson (ZIP) model 

(8) F(S = s) = (1-9)tt s + I9, 

with S € {0, 1,2,...}, ir s defined as in (6), and / an indicator variable taking 
on value 1 if S = 0, and otherwise. The parameter 9 denotes the probability 
of an excess zero in the observed counts, that is, a zero count that is not 
generated by the Poisson process. 

Now suppose that in the context of randomized response the true sum 
score variable S is generated by a Poisson process. In the absence of SP 
the observed sum score variable S* is entirely generated by the Poisson 
randomized response process. In the presence of SP, however, S* is generated 
partly by the Poisson randomized response process and partly by SP. Let the 
parameter 9* denote the probability that the observed sum score is generated 
by SP, and let 1 — 9* denote the probability that the observed sum score is 
generated by a Poisson randomized response process. The distribution of S* 
is then given by 

M 

(9) F(S* = s*\s*,s< M) = (1 - 9*) ]T ga .|,-JJf— + 1*9*, 

s=0 l^s=0 71 's 

where I* is an indicator variable taking on the value 1 if S* = 0, and 
otherwise. 

Both parameters A and 9* in (9) can be modeled as a function of predic- 
tors. Let variable Si denote the true sum of sensitive characteristics of indi- 
vidual i, for i G {1, . . . , n}, and let Xj = (xio, . . . , Xik)' and Zj = (zio, . . . , Zu)' 
be vectors that may or may not contain the same predictors. Let the Poisson 
parameter of individual i depend on Xj according to 

(10) A l = exp(x / t /3), 

and let the probability of zero-inflation depend on Zj according to 
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where f3 = (/3o, • ■ • , @k)' and 7 = (70, . . . , 7/)' are parameter vectors. The ZIP 
randomized response regression model is given by 

M 

(12) F(S* = s*\s*,s t < M,x 4 ,z 4 ) = (1 - 9!) ]T g <[s ^ + 1*6*, 

Si =Q l^s^O^Si 

where n Si = exp(-A.;)A- i / Si \. 




3.4. Estimation. The ZIP randomized response regression model (12) 
(as well as the other models presented in this section) can be estimated by 
maximizing the kernel of the observed-data log-likelihood 

lnf (/3, 7 |s*,X,Z) 

n ( M 
i=l V Si=0 Z^s i =0 7r s i 

with respect to the parameters (3 and 7. We have written code for the quasi 
Newton-Raphson procedure QNewtonmt of the statistical software program 
GAUSS to estimate the model parameters. The procedure uses the BFGS 
method with numerically computed gradients and Hessian matrix, and stan- 
dard errors are obtained from the inverse of the estimated Hessian. Conver- 
gence is generally fast, but due to machine imprecision problems may be 
encountered with the inversion of the Hessian. The use of slightly different 
starting values usually solves this problem. The observed-data likelihood is 
convex and unimodal when evaluated as a function of the parameters 6 and 
A. Figure 2 shows the shape likelihood function for the ZIP randomized 
response model 9 given the social security data. This model does not in- 
clude any predictors for the parameters A and 8, and the likelihood function 
is evaluated for the SP parameter € (0,0.25) and the Poisson parameter 
A 6 (0.25,0.75). 



3.5. The Poisson assumption. It is a well known statistical result that for 
M> 1, 7r< 1 and Mir ~ 1, the distribution of the sum of M i.i.d. Bernoulli 
variables with success probability ir is approximated by a Poisson distri- 
bution with parameter A = Mir. The Poisson randomized response models 
presented in this paper are based on the assumption that the five randomized 
response variables Y m are Bernoulli variables and that the sum of these vari- 
ables follows a Poisson distribution with parameter A = J2m=i n im > where 
7r lm denotes the success probability of variable Y m (i.e., the prevalence of 
the sensitive characteristic). In this section we evaluate the validity of this 
assumption, given that in our example M is relatively small and that the 
success probabilities ix\ m are not identical for different m. 
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Fig. 2. Likelihood function for the ZIP randomized response model evaluated for the SP 
parameter 8* 6 (0,0.25) and the Poisson parameter A £ (0.25,0.75). 



Table 1 








1 


2 


3 


4 


5 


Exact distribution 
Poisson approximation 


0.7250 
0.7401 


0.2498 
0.2228 


0.0244 
0.0335 


0.0008 
0.0033 


0.0000 
0.0002 


0.0000 
0.0000 



To evaluate the adequacy of the Poisson assumption, we first derive 
the exact distribution of the sum of five independent Bernoulli variables 
with success probabilities equal to the prevalence estimates 7ri m of the five 
variables Y m of the social security survey. The prevalence estimates ob- 
tained with the multinomial randomized response model (1) are 7fi 1 < 0.001, 
7Ti 2 = 0.050, vfi 3 = 0.009, 7?i 4 = 0.069 and 7r ll = 0.172, with 7f ll set equal to 
0.001. We then approximate this distribution by a Poisson distribution with 
^ = Em=i ^im = 0.301. The two distributions are shown in the Table 1. 

The Poisson approximation assigns more mass to the zero count and to 
counts larger than 2, and thus overestimates the true variance. It underes- 
timates the probability of count 1 by 0.0270, which corresponds to a rela- 
tive difference of approximately 11%. In view of the fact that the absolute 
deviations in probability of the remaining counts are smaller, the Poisson 
approximation seems satisfactory for all practical purposes. 
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4. Analysis of the social security data. Table 2 presents fit indices for the 
multinomial randomized response model (M), the Poisson randomized re- 
sponse model (V), the ZIP randomized response null-model (Zq), the model 
Z/3 including the five demographic predictors of the Poisson parameter, and 
the full model Z~ q with the additional two predictors of SP. The table 
reports the loglikelihood, the Akaike Information Criterion (AIC) given by 
2k — 2ln£* , the Bayesian Information Criterion (BIC) given by k Inn — 2 In £*, 
with ln£* the maximized loglikelihood and k the number of independently es- 
timated parameters. For the models without predictors, we present the Pear- 
son chi-square statistic X 2 with df = M — k, where M denotes the number 
of independently observed sum score frequencies. The last column of Table 
2 presents the SP probability estimates 9* = (Ya=i I(S*=o))~ 1 Yh=i I(s*=o)@i 
for the three ZIP models. 

Although model M. is saturated, the fitted response frequencies no = 
272.0, n\ = 319.1, n 2 = 195.3, n 3 = 66.7, n 4 = 12.6, n 5 = 4.3 do not equal the 
corresponding observed response frequencies. The fact that X 2 is nonzero 
with zero degrees of freedom indicates that one or more of the estimates are 
on the boundary of the parameter space [van den Hout and van der Heijden 
(2002)]. The expected distribution of the true sum score variable S is 

tt(A4) = (0.878,0.000,0.116,0.000,0.000,0.006), 

with (near) zero-probability estimates for one, three and four rule violations. 
An interesting result is that the probability estimate of 0.6% for five rule 
violations is inconsistent with the fact the smallest univariate prevalence 
estimate of regulatory noncompliance is only 0.1% (see Section 3.5). 

Model V clearly does not fit well, indicating that for our application the 
Poisson assumption does not hold. SP is introduced in model Zq with an 
estimated probability of 12.6%. This model fits substantially better and 
is the best model in terms of BIC. The Pearson chi-square of 19.6 with 
4 degrees of freedom, however, indicates lack of fit. The fitted frequencies 
n = 287.2, ni = 298.9, n 2 = 199.5, n 3 = 70.1, n 4 = 13.3 and n 5 = 1.1 show 



Table 2 

Loglikelihoods, AIC's, BIC' s and Pearson X 2 statistics, and SP estimates 9* 





Model 


Loglik. 


AIC 


BIC 


fc 


X 2 


df 


0* 


M 


Multinomial 


-1170.8 


2351.6 


2375.4 


5 


6.1 







V 


Poisson 


-1183.3 


2368.6 


2373.4 


1 


56.0 


5 




Z 


ZIP (null) 


-1173.2 


2350.5 


2360.0 


2 


19.6 


4 


0.126 


Zp 


ZIP (incl. 0) 


-1167.0 


2348.1 


2381.5 


7 






0.124 


Z-y,0 


ZIP (full) 


-1165.0 


2348.0 


2391.0 


9 






0.121 
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that the lack of fit is primarily due to the underestimation of n^, this cell 
contributes about 80% (14.6) to the total X 2 value. In terms of AIC the 
models Zp and Z^ t p fit best. Both models estimate the SP probability a 
little above 12%. The best model is Z^ t p, with the marginal distribution of 
the fitted values of Si given by 

7f(2 7>/ 3) = (0.657,0.267,0.063,0.011,0.002,0.000). 

The AIC and BIC disagree with respect to model choice. Since randomized 
response requires much larger samples than direct question designs and the 
BIC punishes for sample size, we feel that the BIC might be too conservative. 
Therefore, we prefer the model with the lowest AIC, which is Z~p. This 
choice is further motivated by the fact that the AIC decreases to 2343.4 
when the four nonsignificant regression parameters in this model (see Table 
3) are set to zero. In this case the BIC becomes 2367.2, so that according to 
this criterion, Zq remains the preferred model. The disagreement between 
the two criteria indicates that the evidence for the effects of the predictors 
in model Z~ a is not strong. 

Table 3 presents the parameter estimates of the predictors in model Z^ t p. 
The upper part of the table shows the results for the predictors in the vec- 
tor x. The last column reports the effect size exp(/3), expressing the relative 
change in the Poisson parameter for a unit change in the predictor. (For con- 
tinuous variables, the standardized effect size can be computed by raising 
the reported effect size to the power of the standard deviation of the pre- 
dictor.) The variables year unemployed and knowledge rules are significant 
predictors of the Poisson parameter. Regulatory noncompliance increases af- 
ter the first year of unemployment; the estimated number of rule violations 
for a person unemployed longer than 1 year is 1.78 times that of a person 
unemployed less than 1 year. Better knowledge of the rules is associated 
with lower levels of regulatory noncompliance; the standardized effect size 
of 0.78 denotes the factor change in the Poisson parameter for each standard 
deviation increase in the score on knowledge rules (sd = 0.90). 

The lower part of Table 3 reports the parameter estimates for the predic- 
tors in the vector z. The last column reports the effect size exp(7), which 
expresses the relative change in the odds of SP for a unit change in the 
predictor. The parameter estimate for the variable understanding is signif- 
icant. Better understanding of the forced response method results in less 
self-protective responses; the standardized effect indicates that the odds of 
SP decrease by approximately two-thirds (0.67) for each standard deviation 
increase in the score of understanding (sd = 0.85). 

In order to assess the fit of model more closely, we evaluate the corre- 
spondence between the observed and fitted frequencies within the response 
categories of each the predictor variables. Figure 3 plots the Pearson resid- 
uals (n s * Xjk — n s * x . k )/Jn s * x . k , with n s * Xjk denoting the observed frequency 
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Table 3 



Parameter estimates, standard 


errors (se 


), t-values and 


effect sizes for 


model Z lt p 


Predictors in x 




(se) 


t-val. 


exp03) 


Constant 


-0.13 


(0.38) 


-0.32 




Gender (female) 


0.21 


(0.22) 


0.95 


1.23 


Age (< 26) 


0.50 


(0.36) 


1.39 


1.65 


Education 


0.19 


(0.18) 


1.07 


1.21 


Year unemployment (< 2004) 


0.58 


(0.29) 


1.97 


1.78 


Knowledge rules 


-0.27 


(0.12) 


-2.34 


0.76 


Predictors in z 


7 


(se) 


t-val. 


exp(9) 


Constant 


-0.64 


(1.04) 


-0.61 




Trust 


0.14 


(0.33) 


0.43 


1.15 


Understanding 


-0.46 


(0.23) 


-1.99 


0.63 



of persons with sum score s* and score k on predictor j, and n s * Xjk denot- 
ing the corresponding fitted frequency. Because of the low frequencies of the 
observed sum scores 4 (n = 7) and 5 (n = 5), these two categories have been 
collapsed into the single sum score category 4/5. 

The upper three plots of Figure 3 do not show systematic patterns or 
outliers for the dummy variables. Only the category of respondents younger 
than 27 with sum score zero shows a moderately large (negative) residual, in- 
dicating that this group is slightly overestimated. The remaining four plots 
in the lower part of Figure 3 show large residuals for the predictor trust. 
The plot shows underestimation of respondents who have no trust in the 
randomized response design and who have an observed sum score of either 
zero or four or five, with an exceptionally large residual in the latter cate- 
gory (n = 3). Since the combination of no trust in the randomized response 
design and yes responses to (almost) all sensitive questions is somewhat 
counterintuitive, it suggests the presence of a response mechanism opposite 
to that of SP; it may be the case that there are (a few) respondents who do 
not trust randomized response and who therefore (almost) always answer 
yes, irrespective of the outcome of dice. Although this is only a tentative 
explanation for the large residual, it would be interesting to see whether 
a similar response mechanism could also be detected in other randomized 
response applications. 

The results presented in this section show evidence for the presence of SP 
in the data; the models with the SP parameter fit better than the other mod- 
els. The degree of freedom needed to estimate the SP parameter is gained by 
the Poisson assumption for the sum scores of regulatory noncompliance. In 
this application the Poisson models fit the data well, except for the under- 
estimation of the five cases with an observed sum score 5. Further research 
is needed to explore the nature of this misfit. 
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gender age year unemployment 



2 o 



o female fn=397j 

£ 1--J -A • ■!= ■■■'■?, 



1 1 1 1 — 

1 2 3 4/5 
observed sum score 



o 26orvQUnger (n^SB) 
£ 27 or Older tfi=S33} 




1 2 3 4/5 
observed sum score 



I 



° iti 2004 (rw257) 
IHfore 2004 (n=o13) 
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— i — i — i — i — 

1 2 3 *$ 
observed sum score 



education 
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- 4 2(m432> 
+ 3(n-326) 






■ 

o 


o 


1 1 1 

1 2 


1 1 

3 4/5 



o 1 (n=10) 

a 2(n.eo) 

+ 3(n=162) 
4(n=451) 
<J 5(11=187) 






:> 





OUSerVKl Sum SWre 



n — i — i — T 



ot>serv«i si'" 1 score 




gDServed Sum Score 



1 2 3 4/5 

oUseives" sum score 



Fig. 3. Pearson residuals given predictor score and observed sum score (with sum scores 
4 and 5 collapsed into single category). 



5. Discussion. In this paper we introduce a zero-inflated Poisson regres- 
sion model for the analysis of randomized response sum score data. The 
central assumption underlying the model is that the true sum score vari- 
able follows a Poisson distribution, and that the presence of SP results in 
a zero-inflated distribution of the observed sum score variable. We present 
an example with a randomized response sum score variable assessing non- 
compliance with social security regulations. The ZIP randomized response 
model is used to find (1) the probability distribution of regulatory noncom- 
pliance, (2) the probability of SP, (3) significant predictors of regulatory 
noncompliance and (4) significant predictors of SP. 

From a substantive point of view, the ZIP randomized response regres- 
sion model yields some interesting results. For officials at the Department 
of Social Affaires, the negative effect of rule knowledge on regulatory non- 
compliance, suggesting that noncompliance is to a certain extent due to 
ignorance, may be of assistance in the formulation of new policies. The 
negative association between understanding of the forced response design 
and the probability of SP is especially interesting to social scientists in- 
terested in randomized response method. This result, that coincides with 
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the conclusions of a study of psychological aspects of randomized response 
[Landsheer, van der Heijden and van Gils (1999)], suggests that adjustments 
in the instructions that would help the respondents to better understand the 
forced response design may reduce response bias and thereby enhance the 
validity of the responses. 

The central assumption of the model that the true sum score variable 
is generated by a Poisson process is questionable since (1) the number of 
Bernoulli (i.e., binary randomized response) variables making up the ran- 
domized response sum score variable is limited and (2) the success proba- 
bilities (i.e., the prevalence of the sensitive characteristics) are not identical. 
Based on the univariate prevalence estimates, we demonstrate that in our 
example the Poisson approximation is satisfactory. An evaluation study with 
manipulation of the numbers of Bernoulli variables and of success probabil- 
ities (not reported here) shows that the quality of Poisson approximation is 
most affected if one (or more) of the success probabilities becomes larger. 
This finding is corroborated by the more formal result of Serfling (1978) that 
the absolute deviations between a series of Bernoulli variables with different 
success probabilities and its Poisson approximation increase as a function 
of the squared success probabilities. Although it is difficult to give exact 
figures, we feel that the Poisson assumption is justified as long as the preva- 
lence estimates of the binary randomized response variables do not exceed 
0.25. Since randomized response deals with sensitive characteristics that are 
rare by definition, the risk of this happening should be small. 

SUPPLEMENTARY MATERIAL 

The social security survey data (doi: 10. 1214/07- AOAS135SUPP; .pdf). 
The survey was conducted in 2004 by the Dutch Department of Social Af- 
faires amongst 870 social security beneficiaries. The data contain the re- 
sponses to five randomized response items assessing noncompliance with 
social security regulations, and seven background variables. 
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